Back

DNA Research

Oxford University Press (OUP)

Preprints posted in the last 30 days, ranked by how well they match DNA Research's content profile, based on 23 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.

1
New chromosome-level haplotyped genome assemblies and annotation for the Japanese Quail (Coturnix Japonica)

Cabau, C.; Degalez, F.; Leroux, S.; Gourichon, D.; Serre, R.-F.; Vernette, C.; Donnadieu, C.; Iampietro, C.; Vandecasteele, C.; Pitel, F.; Klopp, C.

2026-05-14 genomics 10.64898/2026.05.12.724545 medRxiv
Top 0.1%
12.2%
Show abstract

The Japanese quail (Coturnix japonica) is a widely used model organism in developmental biology, genetics, and agriculture. Here, we present new, haplotyped, high-quality genome assemblies of the Japanese quail, generated using a combination of state-of-the-art sequencing technologies, including PacBio HiFi long reads, Oxford Nanopore sequencing, and Hi-C scaffolding. This assembly has a total length of 1.19 Gb, 80% of which is included in chromosomes, and is highly complete (BUSCO score aves_odb10: 97.3). Assembly metrics show a marked improvement in contiguity, with a significantly higher scaffold N50 and a lower number of contigs compared to the reference genome assembly. Remarkably, the assembly extends previously truncated chromosome ends, with 31 telomeres detected. In addition, we merged the existing Ensembl and Refseq annotations and obtained a combined set of 26,102 genes, of which 25,038 genes were successfully mapped on the improved assembly haplotype 1 (Cjap1.hap1). Together, these new genome assemblies and their enriched annotation provide a robust genomic framework for future research. They enhance our ability to investigate developmental processes, genetic and epigenetic inheritance, and host-pathogen interactions. Furthermore, they offer valuable insights for conservation genetics and sustainable breeding programs. This resource represents a critical step forward in leveraging the full potential of the Japanese quail as a model species in both basic and applied research.

2
Chromosome-level genome assemblies of the red algae Porphyra dioica and Porphyra linearis

Morcillo, J.; D hondt, S.; Lipinska, A.; Bouckenooghe, S.; Noyen, L.; Van de Vloet, A.; Vranken, S.; Knoop, J.; Leliaert, F.; De Clerck, O.

2026-05-16 genomics 10.64898/2026.05.14.725108 medRxiv
Top 0.1%
10.0%
Show abstract

As one of the earliest-diverging multicellular eukaryotic lineages, the bladed Bangiales (Rhodophyta) possess a deep evolutionary history with a central role in the multi-billion-dollar global seaweed aquaculture industry. Although North Atlantic representatives are emerging candidates for regional mariculture, the scarcity of high-quality genomic resources for these taxa hinders both fundamental research and commercial optimization. To address this, we present the first chromosome-level genome assemblies for two native European species: Porphyra dioica (150.44 Mbp) and Porphyra linearis (95.22 Mbp). By integrating Oxford Nanopore Technologies (ONT) long-read sequencing with Hi-C proximity ligation, we generated highly contiguous nuclear genomes resolved into five chromosomes. Structural gene models were predicted through the BRAKER3 pipeline, identifying 12,548 and 10,382 protein-coding genes for P. dioica and P. linearis, respectively. Subsequent homology-based functional annotation characterized 57.4% and 59.8% of these predicted proteins. Supplemented by circularized organellar genomes, these reference genomes provide a critical framework for future research, enabling comparative studies of Atlantic-Pacific divergence and facilitating the development of selective breeding programs for sustainable European aquaculture.

3
Highly contiguous reference genome assembly of the endangered Orces blue whiptail Holcosus orcesi

Pozo, G.; Cisneros-Heredia, D. F.; Barragan-Orbe, D.; Sanchez-Nivicela, J. C.; Arbelaez, E.; Torres, M.

2026-05-16 genomics 10.64898/2026.05.14.725226 medRxiv
Top 0.1%
6.2%
Show abstract

Holcosus orcesi, the Orces Blue Whiptail, is a Critically Endangered lizard endemic to the upper Jubones River basin in southern Ecuador. Restricted to a narrow elevational range within semi-arid Andean shrublands, it represents one of the few montane members of a predominantly lowland lineage. Here we present the first high-quality reference genome for H. orcesi, generated using Oxford Nanopore Technologies long-read sequencing. The assembly spans 1.68 Gb across only 91 contigs, with an N50 of 76.2 Mb and a BUSCO completeness of 96.8%, making it among the most contiguous and complete squamate genomes to date. Structural annotation predicted 25,682 genes, of which 85% showed homology to known proteins and 45% were assigned Gene Ontology terms. Repetitive elements accounted for 46.3% of the genome, with LINEs representing the predominant class. This genome provides a foundational resource for future evolutionary, comparative and conservation-genomic research of H. orcesi and other mountain reptiles, enabling studies of population genomics, local adaptation, and genomic erosion in isolated populations. By expanding the genomic representation of tropical montane reptiles, this work helps address longstanding phylogenetic and geographic gaps in global biodiversity genomics and provides a foundation for evidence-based conservation of H. orcesi and related taxa.

4
Gene model for the ortholog of Lst8 in Drosophila yakuba

Lawson, M. E.; Sanow, K. A.; Chetana, K.; Taylor, E.; Morgan, A.; Flannery, D.; Elsie, C.; Rele, C. P.; Reed, L. K.; O'Rourke, K. S.

2026-05-14 genomics 10.64898/2026.05.12.723325 medRxiv
Top 0.1%
4.8%
Show abstract

Gene model for the ortholog of Lst8 (Lst8) in the May 2011 (WUGSC dyak_caf1/DyakCAF1) Genome Assembly (GenBank Accession: GCA_000005975.1) of Drosophila yakuba. This ortholog was characterized as part of a developing dataset to study the evolution of the Insulin/insulin-like growth factor signaling pathway (IIS) across the genus Drosophila using the Genomics Education Partnership gene annotation protocol for Course-based Undergraduate Research Experiences.

5
A gap-free, telomere-to-telomere genome assembly for the Caenorhabditis briggsae reference strain AF16

O'Connor, L. M.; Moya, N. D.; Jhaveri, N. S.; Tanny, R. E.; Khorshidian, A.; Lyu, H.; Chamberlin, H. M.; Baird, S. E.; Andersen, E. C.

2026-05-04 genomics 10.64898/2026.04.30.721887 medRxiv
Top 0.1%
3.5%
Show abstract

The nematode Caenorhabditis elegans was the first metazoan to have its genome completely sequenced and assembled. Since that time, researchers have continuously updated the reference genome and manually curated its approximately 20,000 genes. The closely related species, Caenorhabditis briggsae, has served as a comparative model because of its similar morphology, mode of reproduction, and patterns of intra-species genetic variation. However, the genomic resources for C. briggsae lag behind C. elegans, hindering comparative genomics studies between the species. Decades of experimentation have been performed in the AF16 reference strain genetic background, so we obtained high-coverage long-read sequencing and high-throughput chromosome conformation capture data to create an updated reference genome for an isogenic derivative of AF16, named CGC2. The CGC2 genome is vastly improved relative to the existing AF16 assemblies, with no unplaced sequence, no gaps, and telomere-to-telomere contiguity. To provide genomic resources for CGC2, we exploited deep RNA-seq libraries from all developmental stages to predict protein-coding gene annotations comparable in accuracy and completeness to the existing AF16 gene models. We also performed lift-over of 108 validated insertion-deletion variants to the updated coordinate system of the CGC2 genome to facilitate future mappings of mutations. In summary, we present an updated reference genome for the canonical AF16 reference strain with improved genomic resources to enable high-quality intra- and inter-species comparative studies.

6
Near chromosome-level genome assembly for the invasive annual forb Centaurea melitensis

Dant, A.; Pelosi, J.; Northing, P. C.; Dlugosch, K. M.

2026-05-20 genomics 10.64898/2026.05.18.726060 medRxiv
Top 0.2%
1.8%
Show abstract

PremiseCentaurea melitensis (Asteraceae) is a problematic invader of grasslands globally, but little is known about its genetic makeup. Here we develop a reference genome to facilitate studies of its invasion history, genetic variation, and evolution. MethodsInbred offspring of a single individual of C. melitensis from its invasion of California, USA were used for flow cytometry to estimate genome size, and for genomic DNA extraction. DNA was sequenced with PacBio HiFi technology (yield = 85.7Gb). The genome was assembled with Hifiasm and annotated with BRAKER3. GENESPACE was used to compare gene order (synteny) with three other species within the subfamily Cichorioideae. ResultsWe estimated a mean genome size of 795.0 Mbp for C. melitensis, and our assembly totaled 696.6 Mbp in 48 contigs (N50 = 55.6 Mbp; BUSCO = 98%), with annotation of 25,157 protein-encoding genes. This included four telomere-to-telomere putative chromosomes, nine additional chromosome arms terminated by telomeric repeats, and a complete chloroplast genome. Synteny varied markedly across the genus and subfamily, suggesting a dynamic history of structural variation in the lineage of C. melitensis. DiscussionWe provide a highly complete and contiguous genome assembly to facilitate the further study of genomic variation in C. melitensis.

7
YY1 Binding Motif at Upstream of Rep/Cap Increases AAV Yield and Full Capsids

Ofusa, Y.; Nishio, S.; Enoki, T.; Mineno, J.; Ozawa, K.; Mizukami, H.; Ohba, K.

2026-05-22 microbiology 10.64898/2026.05.21.726733 medRxiv
Top 0.2%
1.7%
Show abstract

Adeno-associated virus (AAV) vectors are widely used in gene therapy, whereas low manufacturing efficiency and a large proportion of empty capsids are major obstacles. This study focused on the Yin Yang 1 (YY1) binding motif (YY1-motif) and investigated the effect of its presence or insertion at upstream of the Replicase (Rep)/Capsid Cap) gene on AAV vector production. We found that the YY1-motif incidentally presented in a Rep/Cap plasmid was associated with high vector production. We then designed several modified Rep/Cap (RC2) constructs. The YY1-motif insertion at the upstream of Rep/Cap gene increased vector yield in a repeat-number-dependent manner, and similar effects were not observed with other promoters insertion. Furthermore, the insertion of the YY1-motif reduced the amount of Cap protein per the same amount of full particle in supernatants on multiple serotypes, indicating the improvement in the empty/full capsid ratio. The YY1-motif insertion did not affect the AAV vector infectivity. These results denote that the YY1-motif has a universal regulatory function that optimizes the Rep/Cap expression balance, and simultaneously improves the production efficiency and full particle formation of AAV vectors. This finding could contribute to the development of highly efficient and high-quality AAV manufacturing processes.

8
Chromosome-level genome assembly and annotation of the threatened marbled teal (Marmaronetta angustirostris)

Ortego, J.; Lopez-Luque, R.; Backstrom, N.; Green, A. J.

2026-05-14 genomics 10.64898/2026.05.12.723956 medRxiv
Top 0.3%
1.7%
Show abstract

The marbled teal (Marmaronetta angustirostris) is a widely distributed but declining waterfowl species, classified as Near Threatened globally and Critically Endangered in Spain. Despite ongoing conservation actions, including ex situ management and population reinforcement programmes, the genomic consequences of long-term captivity, inbreeding, and patterns of functional genetic variation remain unknown due to the absence of a species-specific reference genome. Here, we present the first chromosome-level genome assembly for this species. The genome was generated using PacBio HiFi long reads and Omni-C data, yielding a 1.15Gb assembly with a scaffold N50 of 76.95Mb. A total of 97.16% of the assembly was anchored into 36 chromosome-scale scaffolds, including the Z and W sex chromosomes. BUSCO analysis recovered 99.2% of conserved avian genes. Gene prediction was performed using both ab initio and homology-based strategies, resulting in 16,048 protein-coding genes. This resource provides a foundation for genomewide analyses of inbreeding, demographic history, and adaptive variation, and will support evidencebased in situ and ex situ conservation strategies for this threatened species.

9
Haplotype-resolved genome of autotetraploid alfalfa (Medicago sativa) Regen-SY27x uncovers large scale structural variation and resistance gene dynamics

Kaur, H.; Cameron, C. T.; Gomez, A.; Mudge, J.; Farmer, A.; Shannon, L. M.; Samac, D. A.

2026-05-05 genomics 10.64898/2026.05.01.722254 medRxiv
Top 0.3%
1.4%
Show abstract

Polyploid genome assembly presents unique challenges due to extensive heterozygosity and complex haplotype structure. We report a haplotype-resolved, chromosome-scale assembly of Regen-SY27x, a genotype of autotetraploid alfalfa (Medicago sativa), which is widely used for genetic modification because of its excellent regenerative capacity in tissue culture. Using PacBio HiFi long reads, Omni-C scaffolding, and linkage map guided phasing, we generated a 3.2 GB assembly comprising four haplotypes with high contiguity and completeness. Kmer-based validation confirmed accurate haplotype separation, while linkage map integration and dotplot analysis identified and corrected chimeric scaffolds. Gene annotation yielded 221,688 protein-coding genes, with more than 99% assigned to pseudochromosomes. Repetitive elements accounted for 62.7% of the genome, dominated by long terminal repeat retrotransposons and a high fraction of Helitrons. The spatial enrichment of Helitrons within gene-dense distal chromosome arms underscores their pivotal role as key drivers of genomic innovation and gene family expansion. We identified 3,696 nucleotide-binding leucine-rich repeat R genes, with Toll/interleukin-1 receptor-like and Rx-type subclasses forming large tandem clusters across haplotypes. Comparative analyses revealed strong macrosyntenic conservation among Regen-SY27x and the publicly available Chinese alfalfa genomes but extensive structural variation both within Regen-SY27x haplotypes and between Regen-SY27x and the Chinese genotypes with tens of thousands of duplications, inversions, and translocations detected. These results demonstrate that a single autotetraploid individual captures extensive structural diversity, but individuals from different populations vary greatly. The Regen-SY27x assembly provides a foundational genomic resource for investigating polyploid genome evolution and identifying genetic variation relevant to biological and agronomic improvement in alfalfa. Article SummaryThis study presents the first chromosome-scale, haplotype-resolved genome assembly of the US alfalfa germplasm, Regen-SY27x, a key alfalfa genotype used widely for genetic engineering. We integrated HiFi long reads, Omni-CTM scaffolding, and linkage map-guided phasing to reconstruct all four haplotypes of this complex autotetraploid. Our results identified 221,688 protein-coding genes and reveal immense intra-individual structural variations dominated by small duplications. This high-quality reference serves as a foundational tool for the alfalfa community, enabling researchers to link complex structural diversity with agronomic traits and further enhance the biotechnological potential of this essential forage crop.

10
Efficient and Robust Genomic DNA Isolation and Next-Generation Sequencing Library Preparation from Recalcitrant Wild Grape Species

Bhattarai, A.; Smith, J.; Abdelgaffar, H.; Carpenter, R.; Mishra, S.; Fuentes, J. L. J.; Shirsekar, G.

2026-05-21 genomics 10.64898/2026.05.19.713680 medRxiv
Top 0.3%
1.2%
Show abstract

This protocol details the extraction of high-molecular-weight genomic DNA from grapevine tissues (wild and cultivated Vitis spp., including pathogen-infected samples) and the subsequent preparation of Illumina(R) whole-genome sequencing libraries using bead-bound Tn5 transposase. It is designed to overcome challenges from polyphenolic compounds and secondary metabolites in wild plants, providing a cost-effective workflow for large-scale population genomics. It includes recipes for buffers, incubation times, critical notes, and troubleshooting tips to maximize yield and library quality. Although designed for the grapevine DNA, this protocol is potentially applicable to other similar wild plant species HighlightsO_LIOptimized CTAB-PTB DNA extraction protocol for field-collected wild plant tissues. C_LIO_LIEffective removal of polyphenols and secondary metabolites associated with DNA using PTB. C_LIO_LICost-effective Illumina DNA Prep library preparation using bead-bound Tn5 transposase (Tagmentation). C_LIO_LIScalable workflow suitable for large-scale population genomics in Vitis species. C_LIO_LIValidated method for high-molecular-weight DNA and high-quality sequencing data. C_LI Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=195 SRC="FIGDIR/small/713680v1_ufig1.gif" ALT="Figure 1"> View larger version (31K): org.highwire.dtl.DTLVardef@b637d4org.highwire.dtl.DTLVardef@10b563aorg.highwire.dtl.DTLVardef@14a32caorg.highwire.dtl.DTLVardef@4c9577_HPS_FORMAT_FIGEXP M_FIG C_FIG

11
Measles Whole Genome Sequencing by an Illumina Tiled Amplification Method

Zubach, V.; Ashfaq, S.; Van Driel, S.; Kaplen, B.; Peters, G.; Laminman, V.; Go, A.; Bonner, C.; Graham, M.; Hiebert, J.

2026-05-16 genomics 10.64898/2026.05.13.724913 medRxiv
Top 0.4%
0.9%
Show abstract

Measles virus remains a significant global health threat, and despite the availability of an effective vaccine, measles cases continue to increase worldwide in recent years. Genomic surveillance has become an essential tool for monitoring virus circulation and investigating outbreaks. Here, we describe a wet-laboratory method for whole-genome sequencing of measles virus using a tiled amplicon approach and Illumina sequencing technology. A previously published Oxford Nanopore-based tiled primer scheme was adapted to include both circulating measles genotypes and for use on the Illumina platform. Two Illumina library preparation kits, Illumina DNA Prep (IDP) and Nextera XT (XT), were evaluated for performance. The IDP kit demonstrated more complete genomes and consistent genome coverage compared with XT. Using quantified reference genomes, the limit of detection was determined to be 10,000 genome copies for genotype B3 and D8. Sequence accuracy was evaluated using previously characterized clinical samples and showed high concordance. This method provides a reliable and sensitive approach for measles virus whole-genome sequencing using Illumina platforms and is suitable for genomic surveillance applications.

12
Genome-wide identification of rhabdoviral sequences in alfalfa (Medicago sativa L.)

Grinstead, S.; Nemchinov, L. G.

2026-05-22 genomics 10.64898/2026.05.20.726541 medRxiv
Top 0.4%
0.9%
Show abstract

We recently reported the identification of endogenous viral elements (EVEs) originating from the Caulimoviridae family within the alfalfa (Medicago sativa L.) genome. Our subsequent identification of ubiquitous rhabdoviral elements in infected and healthy alfalfa tissues by high throughput sequencing prompted us to suggest that the alfalfa genome might be populated with integrated rhabdoviruses as well. Bioinformatics analysis using 26 publicly available alfalfa genomes proved the suggestion accurate. We found multiple non-retroviral segments of the Rhabdoviridae family belonging to the genera Betanucleorhabdovirus and Betacytorhabdovirus that appeared to be stable constituents of the host genome. In that capacity they could potentially acquire functional roles in alfalfas development and response to environmental stresses. We believe this study reveals the first documented case of rhabdoviruses integrated into the alfalfa genome.

13
Increasing the shelf life of tomato fruit by editing the β-D-N-acetylhexosaminidase (β-hex) gene using CRISPR/Cas9 technology.

Murodov, A. A.; Ayubov, M. S.; Mirzakhmedov, M. K.; Obidov, N. S.; Mamajonov, B. O.; Yusupov, A. N.; Bashirxonov, Z. H.; Kamalova, L. K.; Kushakov, S. O.; Bozorov, I. E.; Buriev, Z. T.; Abdurakhmonov, I. Y.

2026-05-05 molecular biology 10.64898/2026.05.01.722371 medRxiv
Top 0.4%
0.8%
Show abstract

Obtaining tomato plants with firm and intact fruit is one of the main goals in tomato breeding programs. Achieving these goals through conventional breeding is time-consuming and can lead to the loss of unwanted traits. In other hand, consumers are concerned about the presence of transgenic elements in plants acquired through RNA interference. The use of CRISPR/Cas9 technology has made it possible to overcome the above-mentioned shortcomings. In this study, the {beta}-D-N-acetylhexosaminidase ({beta}-hex) gene, which is involved in tomato fruit ripening, was knocked out using CRISPR/Cas9. In the resulting mutant plant genome, an indel mutation was found in exons 1 and 2 of the {beta}-hex gene. Plants with a mutation in their genome were observed to have increased fruit firmness and shelf life compared to control plants without affecting fruit quality.

14
Genome sequence of the medicinal plant Urtica dioica reveals the genetic basis of the flavonoid metabolism

Wolff, K.; de Oliveira, J. A. V. S.; Fuerstenberg, L.; Hagedorn, M.; Garz, B.; Borchert, M.; Pucker, B.

2026-05-16 plant biology 10.64898/2026.05.15.725508 medRxiv
Top 0.6%
0.6%
Show abstract

BackgroundUrtica dioica, also known as stinging nettle, is a widespread plant that can indicate high nitrogen availability in the soil. It is probably best known for the pain caused by touching it. U. dioica is also recognized as a medicinal plant with reports claiming applicability against numerous diseases. ResultsA highly continuous genome sequence was constructed based on nanopore long read sequencing data. The total assembly size is 1.1 Gbp with an N50 of 40.7 Mbp. RNA-seq data and hints from other species were integrated to produce a high quality annotation of the protein encoding genes. This genomic resource enabled the identification of genes involved in the flavonoid biosynthesis. A particular focus was on anthocyanin biosynthesis genes as these are crucial for high light and nitrogen deprivation stress response, which is revealed by redding of the leaves. ConclusionThis genomic resource provides the basis for future studies unraveling the biosynthesis pathways underlying various medically important compounds produced by stinging nettles.

15
Single amino acid substitution in DNA Polymerase I dramatically alters infection dynamics of bacteriophage T7

Keown, R. A.; Sikkema, A. P.; Barbone, V. A.; Ferrell, B. D.; Donnelly, O. B.; Iredell, S. C.; Zatopek, K. M.; Brumm, P. J.; Mead, D. A.; Lohman, G. J. S.; Wommack, K. E.; Polson, S. W.

2026-05-21 microbiology 10.64898/2026.05.20.726624 medRxiv
Top 0.6%
0.6%
Show abstract

Viruses constitute a significant proportion of Earths genetic diversity, yet most remain uncharacterized beyond their sequences in viral metagenomes. Linking viral genotypes to phenotypes--especially enzyme function to phage infection dynamics--is challenging due to the lack of cultured virus-host systems. DNA polymerase I (PolA), essential for genome replication in [~]25% of dsDNA phages, provides an opportunity to explore these connections. In phage T7, residue 526 is critical for nucleotide incorporation, with previous in vitro evidence indicating impacts on enzyme efficiency and fidelity. Previous analyses identified three substitutions at this position (Tyr/Y, Phe/F, Leu/L) linked with deeply rooted viral PolA clades. Mutation impacts at residue 526 were tested in vitro and in vivo. The Y526F protein exhibited a 50% reduction in specific activity, and when introduced via High Complexity Golden Gate Assembly into T7 demonstrated a 53% decrease in burst size and significantly longer latent period compared to wild type. The Y526L protein exhibited a 97% decrease in activity, and the Y526L phage was incapable of completing its lifecycle. These findings confirm historical biochemical data, provide in vivo context for these mutations in the T7-E. coli system, and offer experimental support for genotype-to-phenotype associations in viral PolA, informing viral metagenomics studies. GRAPHICAL ABSTRACT O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=162 SRC="FIGDIR/small/726624v1_ufig1.gif" ALT="Figure 1"> View larger version (31K): org.highwire.dtl.DTLVardef@14395e5org.highwire.dtl.DTLVardef@261504org.highwire.dtl.DTLVardef@2dc1e4org.highwire.dtl.DTLVardef@147a7f_HPS_FORMAT_FIGEXP M_FIG C_FIG Created in BioRender. Keown, R. (2026) https://BioRender.com/mhrmup3

16
Rubus armeniacus genome sequence reveals the secrets of blackberry anthocyanin biosynthesis

Wolff, K.; Nowak, M. S.; Thoben, C.; Beuerle, T.; Pucker, B.

2026-05-10 genomics 10.64898/2026.05.05.723051 medRxiv
Top 0.6%
0.6%
Show abstract

Here, we present a comprehensive multiomics analysis of anthocyanin biosynthesis in Rubus armeniacus, known for its dark fruits. A phased genome sequence of the tetraploid blackberry was generated, achieving an N50 of 34 Mb with an assembly size of 1.2 Gbp based on Oxford Nanopore Technology sequencing (ONT). The BUSCO score for the total assembly shows a high completeness of 99.1%. The assembly was separated into 4 pseudohaplophases, with the pseudohaplophase A representing the R. armeniacus genome in 7 chromosome scale contigs, with an N50 of 46 Mbp and 98.8% conserved BUSCO genes. A total of 118,183 protein coding genes were annotated within the genome assembly and all relevant genes encoding enzymes and transcriptional regulators of the anthocyanin biosynthesis pathway were identified within each pseudohaplophase. To further understand the underlying cause of dark pigmentation, the gene expression was analysed during different stages of berry development revealing a strong induction of anthocyanin biosynthesis genes including the anthocyanin activating subgroup 6 MYB transcriptions during the berry ripening process. Further, a quantification of cyanidin-3-O-glucoside in methanolic berry extract, utilizing a UHPLC-HRAM-MS analysis, revealed an approximately 500-fold increase of cyanidin-3-O-glucoside from green to black fruit, indicating that dark pigmentation in R. armeniacus results from high anthocyanin accumulation. Significance statementThis study provides a multiomics analysis of the dark pigmentation of Rubus armeniacus, including a high quality phased assembly and an in-depth analysis of the anthocyanin biosynthesis pathway. A transcriptional and metabolomic analysis revealed that dark berry pigmentation is caused by a high accumulation of cyanidin-3-O-glucoside during fruit ripening.

17
Rapid and flexible assessment of gene functions in plant cells with particle bombardment and linear DNA

Weerasinghe, P. R.; Tsugama, D.

2026-05-18 plant biology 10.64898/2026.05.17.725698 medRxiv
Top 0.6%
0.5%
Show abstract

Biolistic transformation is a versatile tool in plant science, yet high equipment costs and tissue damage from high-pressure gas remain significant barriers. Building on our previously developed "TSGMAC", a low-cost, helium-free biolistic system, we report three major advancements to enhance its throughput, delivery quality, and quantitative capability. First, a "guide barrel" assembled from commercial DIY fittings was developed; it effectively eliminates physical tissue damage and ensures uniform particle distribution, even in soft tissues like bok choy (Brassica rapa subsp. chinensis). Second, a rapid gene expression platform using PCR products was characterized. Results demonstrate that linear DNA constructs are efficiently circularized via non-homologous end joining (NHEJ) in plant cells, and protein expression is robust regardless of the relative positions of the promoter, coding sequence, and terminator. This system bypasses time-consuming cloning. Third, a cost-effective, highly sensitive dual-luciferase assay system utilizing teal Luc (teLuc) and inexpensive firefly luciferase (FLuc) inhibitors was established. This integrated workflow enables rapid, quantitative molecular biology using supermarket-obtained materials and standard PCR reagents. Our findings provide a practical foundation for plant scientists, synergistically accelerating gene functional analysis and genetic tool development.

18
A historical specimen of False Lingzhi (Ganoderma lucidum) resolves a 245-year-old confusion within an important medicinal mushroom group

Varga, T.; Parker, S. R.; Agorini, A.; Dombrowski, A.; Hadfield, L.; Ainsworth, A. M.; Hawksworth, D. L.; Ghobad-Nejhad, M.; Papp, V.

2026-05-14 evolutionary biology 10.64898/2026.05.13.724775 medRxiv
Top 0.6%
0.5%
Show abstract

O_LIPlants and fungi are major sources of natural products beneficial to society, making the study of distinct species essential for discovering new drugs and bioactive compounds. The medicinal mushroom "Lingzhi" or "Reishi" (Ganoderma lingzhi) is widely used in traditional medicine and extensively studied for its bioactive triterpenoids, yet it is commonly identified as Ganoderma lucidum, the type species of the genus, which lacks a type specimen. C_LIO_LIWe sequenced a G. lucidum specimen preserved in the Kew fungarium, which agreed with the original description and was collected from wood of Corylus avellana in southern England. Using this reference specimen, we compiled genomic and ITS barcoding datasets to explore the genetic and geographic variation within this species. C_LIO_LIWe showed that G. lingzhi and G. lucidum diverged more than 12 million years ago and that all seven "G. lucidum" genomes deposited in public databases belong to other species. More than 1000 barcoding sequences showed that the widely used homology-based ITS barcoding is not working in this group, which can be mitigated by a phylogenetic placement approach. The 149 sequences assigned to G. lucidum with high confidence showed a Eurasian distribution and introductions to North and South America and Africa. C_LIO_LIOur study underscores the importance of accurate species identification and provides guidance for a group of pharmaceutical and socially significant species. To further support future studies and the wider public in differentiating between G. lingzhi and G. lucidum, we propose using "False Lingzhi" as the English name for G. lucidum. C_LI Societal Impact StatementTraditional Chinese Medicine has expanded far beyond Asia, with growing markets in North America and Europe for supplements and functional foods. Lingzhi or Reishi (Ganoderma lingzhi), a well-known medicinal mushroom, is valued for its anti-inflammatory and anticancer properties. However, it is often misidentified with species that may not provide the same health benefits. This confusion poses risks to consumer safety, product regulation, and research. Here, we establish a reference using morphological and molecular tools for the most commonly misidentified species (Ganoderma lucidum) and propose the name "False Lingzhi" to distinguish it, supporting accurate identification, safer product development, and reliable research.

19
Fast-annealed 3'-extended dsDNA templates facilitate efficient epitope-tag knock-in in emerging model insects

Nakamura, T.; Ando, T.; Matsuoka, Y.; Niimi, T.

2026-05-20 bioengineering 10.1101/2025.06.20.660821 medRxiv
Top 0.7%
0.5%
Show abstract

CRISPR-Cas genome editing toolkits have expanded the scope of genetic studies in various emerging model organisms. However, their applications are limited mainly to knockout experiments due to technical difficulties in establishing knock-in strains, which enable in vivo molecular tagging-based experiments. Here, we investigated knock-in strategies in the harlequin ladybug Harmonia axyridis, a model insect for evolutionary developmental biology, which shows more than 200 color pattern variations within a species. We tested several knock-in strategies using synthetic DNA templates. We found that ssDNA templates generated founder knock-in strains efficiently (2.5-11%), whereas the 5 regions of ssDNA templates were frequently deleted when the insert length exceeded [~]40 bases. To overcome this limitation, we designed several 3 extended DNA templates. Fast-annealed 3-extended double-stranded DNA templates, which were designed for tagging endogenous proteins with epitope tags, showed high founder generation efficiency (9.9-20.9%) and accuracy (30.8-85.7%). This strategy is also applicable to the two-spotted cricket Gryllus bimaculatus, suggesting that the fast-annealed 3-extended dsDNA template is a versatile DNA template for generating knock-in strains in emerging model insects for developmental genetic studies. Summary statementFast-annealed 3-extended dsDNA templates facilitate efficient CRISPR-Cas9-mediated knock-in in emerging model insects.

20
Generation of promoters enabling high-level constitutive gene expression in both plants and Escherichia coli

Weerasinghe, P. R.; Tsugama, D.

2026-05-18 plant biology 10.64898/2026.05.17.725692 medRxiv
Top 0.7%
0.4%
Show abstract

Functional validation of genetic components in plants often requires cloning them separately into both plant and bacterial expression vectors, a process that is both time-consuming and laborious. This study aimed to simplify this workflow by developing plant-bacteria dual-host promoter systems that drive high-level constitutive expression in both environments. To achieve this, two variants of the chloramphenicol acetyltransferase promoter (PCAT), a bacterial {sigma} factor-dependent promoter, were integrated into the cauliflower mosaic virus 35S promoter (P35S), and their performance was evaluated using a hygromycin phosphotransferase (HPT)-GFP fusion reporter. One of these variants, PCAT1, conferred hygromycin resistance to Escherichia coli (DH5 and BL21 (DE3)) and maintained high-level expression comparable to the original P35S in onion epidermal cells. A hybrid P35S enhancer-PNOS system also conferred hygromycin resistance to E. coli, but its activity in inducing GFP signals in onion cells remained lower than that of P35S. Due to its compact size (89 bp) and efficiency, PCAT1 can serve as a module for converting standard plant vectors into dual-host systems, accelerating gene characterization and the development of new gene-based tools.